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ABSTRACT 

The presence of double-peaked/multicomponent emission line profiles in spectra of 
galaxies is commonly done by visually inspection. However, the identification of com- 
plex emission line profiles by eye is unapproachable for large databases such as the 
Sloan Digital Sky Survey (SDSS) or the integral field spectroscopy surveys of galaxies 
(e.g. CALIFA or MaNGA) . We describe a quick method involving the cross-correlation 
technique for detecting the presence of complex (double-peaked or multiple compo- 
nents) profiles in the spectra of galaxies, deriving simultaneously a first estimation 
of the velocity dispersions and radial velocities of the dominant gaseous component. 
We illustrate the proposed procedure with the well-known complex [OIII]AA4959, 5007 
profiles of the central region of NGC 1068. 
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1 INTRODUCTION 



Early spectroscopic studies of galaxies showed that profiles 
of emission lin es could presen t asymmetries, shoulders, or 
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double peaks ([Heckman et all 198 ll ; iPelat fc Aliorr] I 1980l : 
iGlaspev et ahll 1 9 76l : I S e vf er til 1 943i . e.g.). These features come 
from the light of several gaseous systems with different kine- 
matics that is integrated along the observer's light of sight. 
They have been interpreted as due to rotating ga seous disks, 
outflows /inflows or dual a ctive galact i c nuc l ei (IShen et al 
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1996, 



e-g-. 



Systematic search of double-peaked emission line pro- 
files in larg e spectra databas es (e.g. Sloan Digital Sky Sur- 
vey, SDSS (|York et alj l2000h ) have been performed using 
different selection criteria, bu t the final confirmation is usu- 
ally done by visual inspection (iGe et alj20l3 ; iPilvugin et al.l 



I 20121 ; ISmith et al.ll 20101 ; iLiu et al.ll 201Ch . In this work 



present an automatic procedure to detect multi-component 
emission line profiles in large databases of spectra of galaxies 
based on the symmetry of the cross-correlation function. 

The cross-correlation (C-C hereafter) technique has 
been ext ensively used in astronomy to i nfer radial ve- 
locities (|Anglada-Escude fc Butlerl |2012| ; IWestfall et~ai1 
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compares a problem spectrum with a reference spectrum or 
template. The C-C of two spectra analyses the similarity 
between one spectrum and a wavelength (or velocity) 
shifted version of the other, as a function of this wavelength 
(or velocity) shift. The C-C function only contains the 
frequencies that are common to both spectra. Therefore, 
the C-C function provides a clear indication of the shift 
at which the two spectra are most similar and also a 
quantitative measure of that similarity. As long as the 
largest peak of the C-C function is symmetric, it might be 
used to derive the shift of the problem spectrum as well as 
its velocit y dispersion in conjun ction with the width of the 
template IjTonrv fc Davisll 19791 ). When the two spectra are 
the same, the C-C function becomes the auto-correlation. 

We propose a methodology for searching multi- 
component/double-peaked emission line profiles in the spec- 
tra of galaxies based on the deviation from symmetry of 
the peak of the C-C function. Detai l s on the C-C technique 
can be found in iFurenlid fc Furenlidl fp990l ). lTonrv fc Davis! 
| 19791 ) and references therein. The C-C algorithm might 
be used in the search for binary active galactic nuclei 
in large spectra databases (e.g. SDSS). It may be also 
used to locate spaxels with spectra showing multicompo- 
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nent emission line profiles in Integral Field Spectroscopic 
(IFS) surveys of galaxies (e.g. the on-goi ng Calar Alto 
Lega cy Integral Field Area Survey, CALIFA ll Sanchez et al.l 
|2012| ) or The Mapping Nearby Galaxies at AP O, MANGA 
( http : / /dunlap ?utoronto.ca/research/surveys / [ ) ) . 

2 CROSS-CORRELATION TECHNIQUE FOR 
SEARCHING DOUBLE-PEAKED LINE 
PROFILES 

2.1 Cross-correlation function shape traced by 
bisectors 

The procedure developed in this work is based on the sym- 
metry of the C-C function nearby its main peak. The esti- 
mation of the shift and velocity dispersion of the spectra of 
galaxies might be obtained by fitting a sm ooth symmetric 
functi on to the peak of the C-C function l|Tonrv fc Davis! 
I 19791) . However, C-C is not an even function and therefore, 
this approach is only valid when the shapes of the spectral 
features in the problem spectrum are similar to those in the 
reference spectrum. Emission lines in the spectra of galaxies 
are commonly assumed to be fitted by Gaussian profiles. If 
single Gaussian profiles are assumed for emission lines in the 
template and problem spectra, the velocity shift and veloc- 
ity dispersion of the problem spectrum can be derived by 
fitting a single Gaussian to the peak of the C-C function 
(see Fig. [T^,b) . However, if the emission line profiles in the 
reference and in the problem spectrum are significantly dif- 
ferent, the peak of the C-C function will be asymmetric (see 

Fig.nb,d). 

The reference spectrum must have a large signal-to- 
noise ratio and a spectral resolution similar to or better than 
the problem spectrum. As the C-C of two spectra is similar 
to their convolution, the spectral resolution of the C-C func- 
tion might be significantly reduced during the C-C opera- 
tions. This results in a significant reduction in the capability 
of detecting double-components (see Fig. [T]i). However, the 
template can be selected to have a negligible velocity disper- 
sion compared to the problem spectrum. Indeed, a variation 
of the C-C technique using delta functi ons has been success- 
fully u sed for detection of binary stars (|Furenlid fc Furenlidl 
I 19901 ). In this case, the C-C spectral resolution, and hence 
the capability of detecting double-components, strongly de- 
pends on the problem spectrum spectral resolution. More- 
over, the shape of the C-C peak will correspond to the aver- 
age shape of the line profiles in the problem spectrum (fig- 
uresQJ andQJ). Therefore, the presence of double or multiple 
components in emission line profiles is reduced to study the 
shape of the C-C peak profile. 

The C-C peak profile can be characterized by tracing its 
bisector. Bisector shapes are commonly used in the analysis 
of the mechanis ms that cause asymm etries and variations 
in stellar spectra (|Bastiirk et al.ll201ll . e.g.). The bisector of 
a symmetric profile should remain at constant wavelength 
(or velocity) for all parts of the profile, dividing it into two 
equal parts; any existing asymmetry between the base and 
the peak of the line will remain reflected in the shape of 
the bisector. Bisectors for any profile can be constructed by 
connecting the midpoints of horizontal line segments span- 
ning the width of the profile at a number of intensity po- 
sitions inside the profile. The comparison of the C-C peak 



profile bisector (Vb) with the C-C peak velocity (V p , the 
remained value as if it were symmetric) gives a reference 
value to identify the presence of various gaseous compo- 
nents in the profiles. This procedure does not infer the ac- 
tual number of gaseous components forming the observed 
profile, but an evidence of the presence of various gaseous 
components (at least two). The sign of Vb-V p also gives in- 
formation about the equivalent wavelength (velocity) of the 
double/multiple components relative to the dominant com- 
ponent (wavelength or velocity at the C-C peak): if the sign 
is positive, the secondary (or equivalent) component is red- 
shifted respect to the dominant component. A positive de- 
tection of double/multiple components in a galaxy spectrum 
will depend on the signal-to-noise of the C-C function and 
hence on the signal-to-noise of the problem spectrum. Fig- 
ure [2] shows the bisectors of the C-C peak functions for the 
examples in figure [T] Tracing the bisector on a single emis- 
sion line profile would also provide evidences of the pres- 
ence of double/multiple components forming that profile. 
However, any observational or instrumental signature (e.g. 
cosmic-ray) not properly removed during the data reduction 
process and affecting the selected single emission line could 
result in an asymmetric line bisector and hence in an spuri- 
ous double/multiple component detection. Including several 
emission lines in the selected cross-correlation spectral range 
will smoothed any undesirable feature affecting a single-line 
since the shape of C-C peak function will correspond to their 
average shape. Moreover, as the C-C function also provides 
a quantitative measure of the similarity between the prob- 
lem and template, other advantage of tracing the bisector 
on the C-C peak function instead of on single emission line 
profiles is that any shape of the line profiles can be selected 
to generate the template. Here, we have assumed gaussian 
profiles for the emission lines to create the template but any 
other profile shape could be assumed (e.g. Lorentz profiles). 



2.2 Implementation of the procedure 

The C-C technique uses the information contained in all the 
lines of the selected spectral range for calculating the C-C 
function. The resulting shape of the C-C peak corresponds 
to an average shape of the line profiles in the problem spec- 
trum. 

In order to calculate the C-C function, it is necessary 
to prepare the problem spectrum following these steps: 

1. The ends of the C-C spectral range are selected at the 
continuum to avoid discontinuities at the edges. 

2. The data are divided by the continuum to remove any 
residual curvature in the continuum. 

3. In order to minimize edge effects, the continuum is sub- 
tracted (just subtract the unity when step 2 was performed). 

When a stellar fit had been carried out previously and 
properly subtracted from the original spectra, steps 2 and 3 
can be stripped. 

The reference spectrum is generated including as many 
delta functions as single emission lines are expected in the 
spectral range selected to apply the C-C technique. The 
wavelength (or velocity) position of these delta functions 
corresponds to the center of the emission lines. Different 
weights (in flux) can be given to the delta functions to 
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Figure 1. Example of the cross-correlation technique applied in the [OIII] AA4959, 5007 spectral range, assuming Gaussian emission line 
profiles and a redshift of about 6610 km/s. Spectral resolution for this model is 1650. (a) Reference spectrum (blue-dashed lines) shifted 
at the assumed redshift, and problem spectrum (black line) at redshift-400 km/s. Velocity dispersions for problem and template spectrum 
are 115 km/s. (b) The cross-correlation function of the spectra in (a). The green-dashed lines correspond to the Gaussian profile fit to 
the peak of the C-C function. The Gaussian centroid (at -390 km/s) and full-width-half-maximum (384.17 km/s) provides the velocity 
shift between problem and template and the estimation of the problem velocity dispersion, respectively, (c) Problem spectrum (black 
line) showing double components. The brightest component is described by the problem spectrum (V S ys=6210 km/s and o-=115 km/s) 
in (a), and secondary component is at V S y S =6710 km/s with a cr=115 km/s. Both components have been selected to be clearly resolved. 
Bluc-dashcd line is the template (the same template than in (a)), (d) The C-C function of the problem and template spectra in (c). 
The C-C peak clearly has an asymmetric shape although a double-peak is not resolved due to the degradation of the spectral resolution 
in the calculation of the C-C function. The Gaussian fit to the asymmetric C-C peak provides a velocity shift between template and 
problem spectra of -289 km/s, and the estimation of the velocity dispersion for the problem is 235 km/s. (e) The velocity dispersion of 
the template (blue line) is ~ 8 km/s, negligible compared to the velocity dispersion of the emission lines (ct=115 km/s) in the problem 
spectrum (black line), (f) The C-C function of the problem and template spectra in (e). The asymmetric C-C peak profile is now resolved 
in two peaks. Gaussian fit to the peak of the C-C function provides and estimation of redshift and velocity dispersion of -311 km/s and 
225 km/s, respectively, for the problem spectrum. 
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Figure 2. Bisectors (blue dashed-lines) of the C-C peak functions in Fig.[T^,. Black dotted line corresponds to the peak velocity dividing 
the profile in two parts. When the emission lines profiles in the template and problem spectrum are similar (Gaussian profiles are 
assumed) black dotted line divides the C-C peak function profile in two symmetric parts, fn this case, bisector (blue dashed-lines) and 
peak velocity extension to different intensities (black dottcd-lines) are similar (see (a)). When asymmetries due to double components 
are presented in the problem spectrum, the bisector of the C-C peak function is twisted and differs from the velocity at the peak (see (b) 
and (c)). The difference between the velocity at the peak of the C-C function and the bisector at a determined intensity level indicates 
the presence of asymmetries (double or multiple components) in the problem spectrum profile. The sign of this difference indicates if 
these asymmetries are blue or red-shifted respect to the dominant component (peak of the C-C function). 



account for fix intensity ratio between emission lines ac- 
cording to atomic parameters. Noise contribution is not de- 
sirable to be included in the template to avoid degrada- 
tion of the signal-to-noise when the C-C function is com- 
puted. Both template and problem spectra are normalized 
to the maximum of the brightest emission line in the se- 
lected wavelength. When the reference has been generated 
and the problem spectrum has been rectified, the cross- 
correlation function can be computed. The maximum of the 
cross-correlation function is located and a Gaussian is fitted 
to this peak to obtain a better precision in the determi- 
nation of the velocity shift between template and problem 
spectra (V p ) than the pixel size/resolution element. The bi- 
sector of the C-C peak function can be traced by calculating 
the midpoints of horizontal lines segments slicing the C-C 
peak profile at a number of intensities (e.g. twenty inten- 
sity positions, from 100% to 5% in steps of 5%). In order to 
detect asymmetries, we can compute the difference between 
V p and the bisector at different intensities (e.g. V&(i), for 
i=l,N, being N the total number of intensity positions). We 
will have as many V p -Vt as intensities have been selected to 
trace the bisector. If AV denotes the size of the resolution 
element, then: 



profile presents blue and red asymmetries depending on the 
intensity level (multi-asymmetries hereafter) , indicating the 
presence of multiple gaseous components forming the ob- 
served emission line profiles. 

Obviously, the level of noise in the problem spectrum 
affects the detection of asymmetries. In this sense, a signal- 
to-noise threshold is defined to reduce the number of false 
positive detections of double/multiple components on emis- 
sion line profiles in the spectra of galaxies. Removing bisec- 
tor lower levels might also help to avoid false positive detec- 
tions due to poor signal-to-noise of the spectra. In practice, 
we consider a positive asymmetry detection (blue, red, or 
multi-asymmetries) when N as!/m is equal or larger than 2, 
that is, at least two intensity levels tracing the bisector of 
the C-C peak function must satisfied that |V(,-V p (i)| > AV. 



2.3 Application to integral field spectroscopy 

In this section, we present some examples to illustrate the 
results from the proposed procedure for searching double 
or multiple gaseous components in the spectra of galaxies 
applied on IFS data. 



- A blue asymmetry is detected when V(,-V p < — AV 

- A red asymmetry is detected when Vb-V p > AV 

For the implementation of the procedure, we calculate 
the number of |Vb-V p (i)| values (for i=l,N, being N the total 
number of intensity positions) larger than AV (N as!/m ) and 
also the number of V&-V p (i) smaller than -AV (Nw ue ) and 
larger than AV (N rc( j), considering that: 



When N Q 



=Ni,; ue a pure blue asymmetry in the pro- 



file is detected, and at least two components are present. 

• WhenN a sym— Nred a pure red asymmetry in the profile 
is detected, and at least two components are present. 

• When N asym > N re(i or/and N aaym > N Wue then the 



2.3.1 The data 

We have used the IFS data of the central 24"x20" 
of the Seyfert 2 galaxy NGC 1068 presented in 
ICarcia-Lorenzo et alj ( 19991 ). NGC 1068 is the nearest and 
brightest example of a barred galaxy with an active galac- 
tic nucleus. The presence of several kinematically distinct 
gaseous components in the central region of NGC 1068 
is evident from the large amount of emission-line pro- 



(Ozaki 2009: Emscllcm et all 1 2006: Gcrssen et al.1 


2006; 


Ishigaki et al.l 


2004: [Groves et al.ll 2004 ICecil et al.l 


2002; 


iGarcfa-Lorenzo et al.l 19991: Arribas et al.ll 19961: Cecil et al. 
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I 199d : IPelat fc Alioml I 19801 ). Careful vis ual examination 
(see F ig. [3| and description of the spectra l|Emsellem et al.l 
2006) [e.g] revealed the presence of a minimum of three dif- 
ferent gaseous systems in NGC 1068. When studying IFS 
data, the visual inspection of the spectra is a tedious task 
but accessible for examining a several hundred of spectra. 
This is not the case for the large amount of spectra pro- 
vided by IFS surveys (e.g. CALIFA or MaNGA surveys) of 
galaxies. 

Figure [3] shows the [OIII]AA4959, 5007 line profiles at 
each observed position of N GC 1068 as a spectra diagram 
l|Garcfa- Lorenzo et al ] | 19991 ). These profiles offer a large va- 
riety of examples to test the developed algorithm. 

2.3.2 Results for NGC 1068 

Applying the proposed procedure of studying the symme- 
try of the C-C peak function explained in section §2 to the 
spectra from the central region of NGC 1068, we can point 
out the following: 

• Only 12% of the emission lines profiles in the central 
24"x20" of the Seyfert 2 galaxy NGC 1068 (Fig.© seems 
to present symmetric profiles. The percentage of symmetric 
[OIIIJAA4959, 5007 profiles is 24.5% for those spectra with a 
signal-to-noise larger than 200. 

• 6 % of the total number of spectra presents pure red- 
asymmetries. Red asymmetries are detected in the 26% of 
spectra with a signal-to-noise larger than 200. 

• 51 % of the total number of spectra presents pure blue- 
asymmetries. This percentage is 31.5% for those spectra 
with a signal-to-noise ratio > 200. 

• 31 % of the spectra in the central region of NGC 1068 
present asymmetries, to the blue or red depending on the 
bisector intensity level. In the case of spectra with a signal- 
to-noise larger than 200, the percentage of multi-asymmetric 
emission line profiles is 18%. 

Examples of spectra (the same spectra than in Fig. 7 
in Garcfa-Lorenzo et al. (1999)) displaying asymmetries and 
the bisectors shape of the C-C peak function are shown in 
Fig. H The [OIII] AA4959, 5007 profiles in Fig. Ha) show a 
blue shoulder (corresponding to component 4b in Garcfa- 
Lorenzo et al. (1999) and the additional component in Em- 
sellem et al. (2006)). The shape of the C-C function for 
this spectrum (Fig. H^al) and (a2)) also presents a clear 
blue asymmetry that is traced by the shape of its bisector. 
The peak of the C-C function indicates a radial velocity 
of 1145 km/s for the dominant component in this profile, 
which is in agreement to the systemic velocity, 1144 km/s 
l|Emsellem et all I 20061) . The difference in velocity between 
the bisector at a 10% peak intensity level and the velocity at 
the peak of the C-C function (V ft (10%)-V p ) is -302 km/s, in- 
dicating a blue asymmetry. The [OIII] AA4959, 5007 emission 
lines in Fig. |3Jb) presents double-peaked [OIII] profiles that 
it is also reproduced in the C-C peak function. The radial ve- 
locity of the dominant component in this profile (velocity at 
the peak of the C-C function) is 1618 km/s (474 km/s larger 
than the systemic velocity), which was identified as compo- 
nent 4r in Garcfa-Lorenzo et al. (1999) and as the additional 
component in Emsellem et al (2006). The bisector of the C-C 
peak function clearly indicates the presence of a secondary 
bluer component in this profile. The difference Vt,(10%)-V p 



is -485 km/s. This secondary component was identified as 
component 1+3 in Garcfa-Lorenzo et al. (1999) and as the 
narrow component in Emsellem et al (2006). Emission lines 
in Fig. [4jc) show a peaked profile with a clear red shoulder 
that is reproduced by the shape of the C-C peak function bi- 
sector. In this case, V p =1124.52 km/s (component 1/narrow 
in Garcfa-Lorenzo et al. (1999)/Emsellem et al. (2006), re- 
spectively), while Vf,(10%)-V p = 209 km/s, indicating a red 
component identified as component 3 in Garcfa-Lorenzo et 
al. (1999) and broad component in Emsellem et al (2006). 
For this profile, the lower level of the bisector (Fig. |4fc2)), 
turns to the blue, V 6 (5%)-V 6 (10%) = -104 km/s, although 
still indicating a global red asymmetry (V(,(5%)-V p = 105 
km/s). This twist in the bisector indicates the presence of 
a faint additional component, that was identified as compo- 
nent 2 in Garcfa-Lorenzo et al. (1999). The visual inspection 
of emission line profiles in Fig. |4jd) showed single profiles 
with not signs of double-components. The C-C function is 
symmetric respect to its peak, being V p = 1071 km/s. Only 
at a 5% level of intensity of the C-C peak function, a blue 
asymmetry is detected by the proposed procedure (with a 
Vb(5%)-V p = -188 km/s), while for the rest of the levels, 
the difference V 6 (i)-V p (for i=10%,100% in steps of 5%) is 
always smaller than 10 km/s. This false positive asymme- 
try detection is due to the low signal-to-noise at the level 
in which the asymmetry has been detected. In practice, this 
case will be labeled as symmetric in the implemented algo- 
rithm due to the condition of having at least two intensity 
levels with positive detection for asymmetries. Figure [3] also 
indicates the identified asymmetries for the observed spec- 
tra with a signal-to-noise ratio larger than 200 in the central 
region of NGC 1068. 

The proposed procedure only indicates those spectra 
showing asymmetries due to double or multiple components, 
but the deprojection of the different gaseous systems needs 
any o ther procedure such as Gaussian fit l|Arribas et all 
I 1996h [e.g.1. 



3 CONCLUSIONS 

The identification of double/multiple component emission 
line profiles in spectra of galaxies is the first step for the sam- 
ple selection of many research topics (e.g. binary black holes, 
ionized gas kinematic deprojection) . This paper deals with a 
quick estimation of velocity dispersions, radial velocities of 
the dominant component and the detection of the presence 
of double or multiple components among a large set of spec- 
tra of a galaxy or galaxies using the cross-correlation tech- 
nique and the shape of the C-C peak function. The proposed 
procedure allows processing a large amount of problem spec- 
tra in a short period of time (in the order of minutes) using 
the peak, full-width-half-maximum and symmetry of the C- 
C peak function. 
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Figu re 3. Spectra diagra m of the [OIII] AA4959, 5007 emission lines at 463 positions on the central region of NGC 1068 
ijGarcia-Lorenzo et al.lT 19991 ). The spectra at each location are auto scaled to better show the profile shape. Emission lines nearer 
to the optical nucleus (taken to be the origin) are brighter than those farther out. The plotted spectral range is 4925-5075 A, the same 
selected for the application of the cross-correlation technique. Different colors indicate the asymmetry detected in those profiles with a 
signal-to-noise larger than 200 (red: red-asymmetry; blue: blue-asymmetry; magenta: multi-asymmetries; green: symmetric profiles). 
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